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REFERENCE TO A COMPUTER PROGRAM APPENDIX 

Not Applicable 

BACKGROUND OF THE INVENTION 
1. Field of the Invention 

The present invention generally pertains to computing routes within a network, 
and more particularly to a routing algorithm for computing multiple loop-free routes 
between each source-destination pair. 
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2. Description of the Background Art 

The most popular routing protocols used in today's internets are based on the 
exchange of vectors of distance, such as RIP and EIGRP; or topology maps, such as 
OSPF. It should be noted that RIP and a number of similar routing protocols which are 
5 based on the distributed Bellman-Ford algorithm (DBF) for shortest-path computation, 
suffer from the bouncing effect and counting-to-infinity problems, which limit their 
applicability to small networks using hop count as the measure of distance. While 
OSPF and algorithms based on topology-broadcast are hindered by excessive 
communication overhead, which forces the network administrators to partition the 
fi network into distinct areas which are interconnected by a backbone. As a result the use 
5 of OSPF leads to a complex solution, in tenns of the required router configuration. The 
routing protocol EIGRP utilizes a loop-free routing algorithm called DUAL, which is 

in 

P based on internodal coordination tliat can span multiple hops. 

H In addition to DUAL, several algorithms based on distance vectors have been 

ife proposed to overcome the counting-to-infinity problem of DBF. All of these algorithms 

if-' 

P rely on exchanging queries and replies along multiple hops, a technique that is 

sometimes referred to as diffusing computations, because it has its origin in Dijkstra and 
Scholten's basic algorithm. 

A couple of routing algorithms have been proposed that operate using partial 

20 topology information to eliminate the main limitations of topology-broadcast algorithms. 
Furthermore, several distributed shortest-path algorithms have been proposed that use 
the distance and second-to-last hop to destinations as the routing information 
exchanged among nodes. These algorithms are often called path-finding algorithms or 
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source-tracing algorithms. All of these algorithms eliminate DBF's counting to infinity 
problem, and some of them are more efficient than any of the routing algorithms based 
on link-state information proposed to date. Furthermore, LPA is maintained loop-free at 
every instant. 

With the exception of DASM, all of the above routing algorithms focus on the 
provision of a single path to each destination. A drawback of DASM, however, is that it 
uses multi-hop synchronization, which limits its scalability. Recently a routing protocol 
referred to MPDA has been proposed which is a method based on link-states that 
provides multiple loop-free path routing utilizing one-hop synchronization. 

Therefore, a need exists for a routing protocol which is scalable, and provides 
multipath unequal cost routing based on distance vectors, which is assured to be loop- 
free. The present invention satisfies those needs, as well as others, and overcomes the 
deficiencies of previously developed protocols. 

BRIEF SUMMARY OF THE INVENTION 

The present invention is a routing method that determines multiple loop-free 
paths between source and destination pairs, which utilizes shortest distances and 
predecessor information in its route computation. A variant of MPDA is herein 
presented which is referred to as MPATH, which is a routing algorithm based on 
distance vectors that: (a) provides multiple paths of unequal cost to each destination 
that are free of loops at every instant, both in steady state as well as during network 
transitions, and (b) utilizes a synchronization mechanism that spans only one hop, 
which makes it more scalable than routing algorithms based on diffusing computations 
spanning multiple hops. MPATH is a path-finding algorithm, and differs from prior 
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similar algorithms in the invariants used to ensure multiple loop-free paths of unequal 
cost. The peculiar differences between MPATH and MPDA is a result of the differences 
in the kind of information that nodes exchange. 

An object of the invention is to provide a routing protocol for computing multiple 
routes of unequal cost. 

Another object of the invention is to provide a routing protocol in which the routes 
are assured to be loop-free at every instant. 

Another object of the invention is to provide a routing protocol that does not 
require internodal synchronization which spans greater than a single hop. 

Another object of the invention is to provide a routing protocol of low complexity, 
wherein the requirements for storage, time, computation, and communication are 
minimized. 

Another object of the invention is to provide a routing protocol which always 
converges to a shortest distant route. 

Further objects and advantages of the invention will be brought out in the 
following portions of the specification, wherein the detailed description is for the purpose 
of fully disclosing preferred embodiments of the invention without placing limitations 
thereon. 

BRIEF DESCRIPTION OF THE DRAWINGS 
The invention will be more fully understood by reference to the following 

drawings which are for illustrative purposes only: 

FIG. 1 is pseudocode for a path algorithm according to an aspect of the present 

invention, shown with an initialization procedure and a path routing algorithm to each 
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destination. 

FIG. 2 is pseudocode for a neighbor table update algorithm according to an 
aspect of the present invention. 

FIG. 3A is a topology diagrann within which table updates are exemplified 
according to an aspect of the present invention, shown with adjacent links and neighbor 
tables. 

FIG. 3B is a topology diagram with a distance table for illustrating the table 
update procedure within an aspect of the present invention, shown with a table of 
preferred neighbors. 

FIG. 4A is a topology diagram which exemplifies tie-breaking rules according to 
an aspect of the present invention, shown with unit link costs. 

FIG. 4B is a topology diagram which exemplifies tie-breaking rules according to 
an aspect of the present invention, shown with costs of adjacent links and shortest-path 
trees of neighboring nodes. 

FIG. 4C is a topology diagram which exemplifies tie-breaking rules according to 
an aspect of the present invention, showing a tie-break resolution. 

FIG. 5 is pseudocode which exemplifies updating of the main table according to 
an aspect of the present invention. 

FIG. 6 is pseudocode which exemplifies multipath routing according to an aspect 
of the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 

Referring more specifically to the drawings, for illustrative purposes the present 
invention is embodied in the apparatus and methods generally shown in FIG. 1 through 
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FIG. 6. It will be appreciated tiiat the apparatus may vary as to configuration and as to 
details of the parts, and that the nnethod may vary as to the specific steps and 
sequence, without departing from the basic concepts as disclosed herein. 
1. Distributed Multipath Routing Algorithm 
5 1.1. Problem Formulation 

A computer network is represented as a graph G = {N,L) where is the set of 

nodes, typically routers, and L is the set of edges, links, connecting the nodes within 
the network. A cost is associated with each link that can change overtime, but is 
always positive. Two nodes connected by a link are called adjacent nodes or 
in neighbors. The set of all neighbors of a given node / is denoted by , Adjacent 
nodes communicate with each other using messages and messages transmitted over 

i ^ 

an operational link are received with no errors, in the proper sequence, and within a 
finite timeframe. Furthermore, such messages are processed by the receiving node 
one at a time in the order received. A node detects the failure, recovery and link cost 
IS changes of each adjacent link within a finite time. 

^ The goal of the present distributed nDuting algorithm is to determine at each node 

/ the successor set of / for destination j , which we denote by Sj (t) ^ A'^' , such that 

the routing graph SGj {t) consisting of link set {{m,n) \neSf{t), me n] is free of loops 
at every instant t, even when link costs are changing with time. The routing graph 
20 SGj (t) for single-path routing is a sink-tree rooted at j , because the successor sets 

Sj{t) have at most one member. In multipath routing, there can be more than one 
member in Sj{t) therefore, SGj (t) is a directed acyclic graph with j as the sink node. 
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There are potentially several SGj (t) for each destination J ; however, the routing graph 
we are interested Is defined by the successor sets Sj (t) = [k | Dj [t) < D) {t),k^ A'^'} , 
where D) is the shortest distance of node / to destination j , which is referred to as a 
shortest multipath routing graph for destination J . 

After a series of link cost changes which leave the network topology In arbitrary 
configuration, the distributed routing algorithm should work to modify SGj in such a way 

that it eventually converges to the shortest multipath of the new configuration, without 
ever creating a loop in SGj during the process. 

Since D* is a local variable of node k , Its value has to be explicitly or implicitly 
communicated to node / . If D'j^ is the value of D* as known to node / , the problem 
now becomes one of computing S'j (t) = {k\D'ji^ (t) < D] {t)] . However, because of non- 
zero propagation delays during network transitions, discrepancies can exist in the value 
of Dj and its copy D'j^ at / , which may cause loops to form in SGj (t) . To prevent 

loops, therefore, additional constraints must be imposed when computing S'j. If the 
successor set at each node i for each destination j satisfies certain conditions called 
loop-free Invariant conditions, then the snapshot at time t of the routing graph SGj (t) 
implied by Sj{t) is free of loops. The solution within the present invention solves this 
problem in two parts: (1) computing Dj using a shortest-path routing algorithm called 
PATH, and (2) extending it to compute S'j such that they satisfy loop-free invariant 
conditions at every instant. 
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1.2. Node Tables and Message Structures 



As in DBF, nodes executing MPATH exchange messages containing distances 



to destinations. In addition to the distance to a destination, nodes also exchange the 



identity of the second-to-last node, also called predecessor node, which is the node just 



5 before the destination node on the shortest path. In this respect MPATH is similar to 



several prior algorithms but differs in its specification, verification and analysis and, 



more importantly, in the multipath operation described in the next section. 



The following information is maintained at each node: 



1 . A Main Distance Table that contains Z)j and p'j , where Dj is the distance 



1|| of node i to destination j and /? j is the predecessor to destination j on the shortest 



path from i to j . The table also stores for each destination j , the successor set S'- , 
W feasible distance FD], reported distance RD], and two flags "c/?angeGf"and "report-it". 



2. A Mam Link Table f that is the node's view of the network and contains 



iJf links represented by (m, n, d) where (m, n)\sa link with cost d . 
1]S 3. A Neigiibor Distance Table for neighbor k containing Dj^ and p^^ where 



Dji^ is the distance of neighbor k to j as comnnunicated by k , and is the 



predecessor to j on the shortest path from k to j as notified by k . 



4. A Neighbor Link Table Tl containing the view that neighbor k has of the 



network as known to / and contains link information derived from the distance and 



20 predecessor information in the neighbor distance table. 



5. An Adjacent Link Table that stores the cost l[ of adjacent link to each 
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neighbor k. If a link is down its cost is infinity. 

Nodes exchange information using update messages which have the following 

format: 

1 . An update message can one or more update entries. An update entry is a 
triplet \ j,d,p\,\N\\eve d is the distance of the node sending the message to destination 
j and p is the predecessor on the path to j ; and 

2. Each message candies two flags used for synchronization: query and reply. 

1.3. Computing D] 

As mentioned earlier, the strategy within the present invention is to first design a 
shortest-path routing algorithm and then make the multipath extensions to it. This 
subsection describes our shortest-path algorithm PATH and the next subsection 
describes the multipath extensions. FIG. 1 illustrates pseudocode for an example of the 
PATH procedure. INIT-PATH is called at node startup to initialize the tables, distances 
are initialized to infinity and node identities are initialized to a null value. PATH is 
executed in response to an event that can be either a receipt of an update message 
from a neighbor, or detection of an adjacent link cost or link status (up/down) change. 
PATH Invokes procedure NTU, described in FIG. 2, which first updates the neighbor 
distance tables and then updates Tl with links (m, n, d) where d = D'^ - D'^ and 

m = plk- PATH then invokes procedure MTU, specified in FIG. 5, which constructs T' 

by merging the topologies r/ and the adjacent links . 
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FIG. 3A and FIG. 3B illustrate updating of the main table. FIG. 3A depicts 
adjacent links and neighbor tables of node / , while FIG, SB depicts preferred neighbors 
along with the main link table of node / after merging the neighbor tables. 

The merging process is straightforward if all neighbor topologies r/ contain 
consistent link information, but when two or more neighbors link tables contain 
conflicting information regarding a particular link, the conflict must be resolved. Two 
neighbor tables are said to contain conflicting information regarding a link, if either both 
report the link with different cost or one reports the link and the other does not. 
Conflicts are resolved as follows: if two or more neighbor link tables contain conflicting 
information of link (m, n), then T is updated with link information reported by the 
neighbor k that offers the shortest distance from the node / to the head node m of the 
link, such as ll + D]^^ = min\^ll + Dl^ \ k e N'^ . Ties are broken in a consistent manner; 

one way is to break ties always in favor of lower address neighbor. Because / itself is 
the head of the link for adjacent links, any information about an adjacent link supplied by 
neighbors will be overridden by the most current information about the link available to 
node / . 

FIG. 4A through FIG. 4C shows the significance of the tie-breaking rule. FIG. 4A 
depicts an example network topology with unit link costs. FIG. 4B illustrates node i 
having the costs of its adjacent links and the shortest path trees of its neighbors p and 
q. The distances of nodes x and y from / is identical through both neighbors p and 
q . FIG. 4C illustrates that if MTU breaks ties in an arbitrary manner while constructing 
P , it may choose p as the preferred neighbor for node x and choose q as the 
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preferred neighbor for node y , which results in a graph that has no path from / to j . It 
will be appreciated, therefore, that ties should not be broken in an arbitrary manner. 

After merging the topologies, IVITU runs Dijkstra's shortest path algorithm to find 
the shortest path tree and deletes all links from f that are not in the tree. Because 
there can be more than one shortest-path tree, while running Dijkstra's algorithm ties 
are again broken in a consistent manner. The distances D'. and predecessors p) can 

then be obtained from T . The tree is compared with the previous shortest path tree 
and only the differences are then reported to the neighbors. If there are no differences, 
no updates are reported. Eventually all tables converge such that D] yields the 
shortest distances and all message activity ceases. 
1.4. Computing S] 

In this subsection, the final desired routing algorithm MPATH is derived by 
making extensions to PATH. MPATH computes the successor sets S] by enforcing the 

Loop-free Invariant conditions described below and using a neighbor-to-neighbor 
synchronization. 

Let an "estimate" of the distance of node / to node j , be referred to as the 
feasible distance, FDj; in a similar manner as FZ)j is equal to £>] when the network is in 
a stable state, but to prevent loops during periods of network transitions, it is allowed to 
temporarily differ from Dj . Loop-free invariant conditions can be expressed as follows: 



(1) 



= {k\D^,{t)<Fmt)} 



(2) 
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The invariant conditions (1) and (2) state that, for each destination j , a node / 
can choose a successor whose distance to j , as known to i , is less than the distance 
of node i to j that is known to its neighbors. 

Theorem 1 : If the LFI conditions are satisfied at any time t , the SGj (t) implied by 

the successor sets Sj {t) is loop-free. 

Proof: Let keS] {t) then from Eq. 2 it follows that: 

D^,{t) < FDiit) (3) 
At node k , because node i is a neighbor, from Eq. 1 above, it follows that: 

FD'j{t) < D^,{t) (4) 
Combining Eq. 3, and Eq. 4, it follows that: 

F&j{t) < FD){t) (5) 
Eq. 5 states that, if A is a successor of node i in a path to destination j , then k's 
feasible distance to j is strictly less than the feasible distance of node i to node j . 
Now if the successor sets define a loop at time t with respect to node j , then for some 
node p on the loop, an absurd relation is arrived at wherein FDf[t)<FD^{t), 

Therefore, the LFI conditions are sufficient for loop-freedom. 

The invariants used in LFI are independent of whether the algorithm uses link 
states or distance vectors; in link-state algorithms, such as MPDA, the Z))^^ are 

computed locally from the link-states communicated by the neighbors while in distance- 
vector algorithms, like the MPATH presented here, the D'jj^ are directly communicated. 
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The invariants (1 ) and (2) suggest a technique for computing S'j [t) such that the 
successor graph SGj [t) for destination j is loop-free at every instant. The key is 
detennining FD] [t) in Eq. (1 ), which requires node i to know £)*. [t] , the distance from 
node i to node j in the topology table that node / communicated to neighbor k. 
As a result of non-zero propagation delays, 2]* is a time-delayed version of T' . It will be 
appreciated that, if node / delays updating of FDi with until k incorporates the 
distance £)) in its tables, then FDj satisfies the LFI condition. 

FIG. 6 exemplifies pseudocode for MPATH which enforces the LFI conditions by 
synchronizing the exchange of update messages among neighbors using query and 
reply flags. If a node sends a message with a query bit set, then the node must wait 
until a reply is received from all its neighbors before the node is allowed to send the 
next update message. The node is said to be in ACTIVE state during this period. The 
inter-neighbor synchronization used in MPATH spans only one hop, unlike algorithms 
that use diffusing computation that potentially span the whole network, such as DASM. 

Assume that all nodes are in a PASSIVE state initially with correct distances to 
all other nodes and that no messages are in transit or pending to be processed. The 
behavior of the network where every node runs MPATH is such that when a finite 
sequence of link cost changes occurs in the network within a finite time interval, some 
or all nodes to go through a series of PASSIVE-to-ACTIVE and ACTIVE-to-PASSIVE 
state transitions, until eventually all nodes become PASSIVE with correct distances to 
all destinations. 
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Let a node in PASSIVE state receive an event resulting in changes in its 
distances to some destinations. Before the node sends an update message to report 
new distances, it checks if the distance D'j to any destination j has increased above 

the previously reported distance RD'j . if none of the distances increased, then the node 

remains in PASSIVE state. Otherwise, the node sets the query flag in the update 
message, sends it, and goes into ACTIVE state. When in ACTIVE state, a node cannot 
send any update messages or add neighbors to any successor set. After receiving 
replies from ail its neighbors, the node is allowed to modify the successor sets and 
report any changes that may have occurred since the time it has transitioned to ACTIVE 
state, and if none of the distances increased beyond the reported distance, the node 
transitions to PASSIVE state. Othenwise, the node sends the next update message with 
the query bit set and becomes ACTIVE again, and the whole cycle repeats. If a node 
receives a message with the query bit set when in PASSIVE state, it modifies its tables 
and then sends back an update message with the reply flag set. Otherwise, if the node 
happens to be in ACTIVE state, it modifies the tables but because the node is not 
allowed to send updates when in ACTIVE state, the node sends back an empty 
message with no updates but the reply bit set. If a reply from a neighbor is pending, 
when the link to the neighbor fails then an implicit reply is assumed, and such a reply is 
assumed to report an infinite distance to the destination. Because replies are given 
immediately to queries and replies are assumed to be given upon link failure, deadlocks 
due to inter-neighbor synchronization cannot occur. Eventually, all nodes become 
PASSIVE with con-ect distances to destinations. 
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2. Correctness of MPATH 

The following properties of MPATH are to be proven: (1) MPATH eventually 
converges with Dj ; giving the shortest distances and (2) the successor graph SGj is 

loop-free at every instant and eventually converges to the shortest multipath. PATH 
5 works essentially like PDA except that the kind of update information exchanged is 
different; PDA exchanges link-state while PATH exchanges distance-vectors with 
predecessor infornnation. The correctness proof of PATH is identical to PDA and are 
reproduced here for correctness. The convergence of MPATH directly follows from the 
convergence of PATH because extensions to MPATH are such that update messages 
1|| in MPATH are only delayed a finite amount of time. 

\l Definitions: The n-hop minimum distance of node / to node j in a network is the 

W minimum distance possible using a path of n hops, (links) or less. A path that offers the 

m 

:f n-hop minimum distance is called «-hop minimum path. If there is no path with n hops 

Q or less from node / to j then the n-hop minimum distance from / to 7 is undefined. 

N 

li An n -hop minimum tree of a node / is a tree in which node / is the root and all paths of 
n hops or less from the root to any other node is an n -hop minimum path. 

Let G denote the final topology of the network, as would be seen by an 
omniscient observer after all link changes have occurred. Without loss of generality, 
assume G is connected; if G is disconnected, the proof applies to each connected 
20 component independently. 

It is presumed that a router / knows at least the n -hop minimum tree, if the tree 
contained in its main link table T is at least an n-hop minimum tree rooted at / in G , 
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and there are at least n nodes in r that are reachable from the root / . Note that r is 
such that the links with head nodes that are more than n hops away from i may have 
costs that do not agree with the link costs in G . 

Theorem 2: If node i has adjacent link costs that agree with G and for each 
neighbor k , r/ represents at least an (n-lj-hop minimum tree, then after the execution 
of MTU, the minimum cost tree contained in r is at least an n -hop minimum tree. 
Proof: Let Hi denote an n -hop minimum tree rooted at node / in G and let 

Ml be the set of nodes that are within n hops from / in Hi, Let Df denote the 
distance of / to J in Hi. Let dg be the cost of the link / -> J . Node i is called the 
head of the link i j . The notation / j indicates a path from i to j of zero or more 
links; if the path has zero links, then / = j . The length of path / f-> j is the sum of 
costs of all links in the path. 

Property 1 : From the principle of optimality (the sub-path of a shortest path 
between two nodes is also the shortest path between the end nodes of the sub-path), if 
H and H' are two «-hop minimum trees rooted at node / and M and M' are sets of 
nodes that are within n hops from i in H and H' respectively, then M ~ M' - Ml 
and Ml > n. For each j e Ml the length of path / j in both H and H' is equal 
to . For A > K , Df < Df . 

Let ^'^[Ji^^j^iA » where 4 is the set of nodes in r/ . Because r/ is at least an 
(n-1)-hop minimum tree and node i can appear at most once in each of 4 » ®ach 4 
has at least «-1 unique elements. Therefore, A' has at least n-l elements. 

UCOO-355-2 16 EL645676945US 



Let Ml be the set of n-^ nearest elements to node / in A' , That is, Ml c A' , 
\mI\ = n'^, and for each j e Ml, and veA^ -M^ , 

To prove the theorem it is sufficient to prove the following: 
5 1 . Let Gl represent the graph constructed by MTU on lines 2 and 3. (i.e., before 

applying Dijkstra in line 4). For each j e Ml there is a path / j in Gl such that its 

length is at most Z)f . 

2. After running Dijkstra on Gl on line 4 in MTU, the resulting tree is at least an n- 
k§ hop minimum tree. 

Let us first assume part 1 is true and prove part 2. Frcim the statement in part 1 
yl for each node j e Ml there is a path / 7 in with length at most Dy In the 
^ resulting tree after running Dijkstra, we can infer there is a path i h> j with length at 
most £)f . Because there are w-1 nodes in Ml , the tree constructed has at least n 

CI nodes including node i . From property 1 , it follows that the tree constructed is at least 

pi 

15 an n-hop minimum tree. 

To prove part 1, order the nodes in Ml in non-decreasing order. The proof is by 

induction on the sequence of elements in Ml . The base case is true because for , 

the first element of M' , l^^ = min[ll\kGN'] and = DJ'""! . As induction hypothesis, 

let the statement hold for the first m - 1 elements of Ml . Consider the m^^ element j 
20 ^ Ml, Let K be the highest priority neighbor for which I>5^ + 4 = 



1^ 
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TnmjDj^ + /^ IkeN^y At most m - 1 nodes in can have lesser or equal distance than 
j which implies path K\-^ j exists with at most m -1 hops. Let v be the neighbor of j 
in 7^ . Then the path K\-^v ^ j has at most m -1 hops. Because is at least a (n- 
1)-hop minimum tree, the link v -> y must agree with G . Since D^^ + <Djj^ + , 

from the induction hypothesis there is a path i v in G , such that the length is at 
most D'^\ 

n 

The following now shows that the preferred neighbor for v is also K , so that the 
link V 7 will be included in the construction of Gl thus ensuring the existence of the 

path / H> j in G^ . If some neighbor K' other than K is the preferred neighbor for v 

then one of the following two conditions should hold: (a) D^f^r + 1^ < + or (b) 

^Ik' 4' ~ ^Ik 4 priority of K' is greater than priority of K. 

Case (a): Because + 4 - ^jr ^k' follows that the path v i-> j in T^, , is 
greater than cost of v -> y in G which implies that r^,, is not an (n-l) hop minimum 
tree, which contradicts the assumption. Therefore D^^ + 4 = min\Dl^ \ k^N'^ . 

Case (b): Let Qj be the set of neighbors that give the minimum distance for j , 
such as for each k e Qj, Dj^ + C " min\^D)^ +4 I ^ ^ • Similarly, let be such that 
for each k g Q^, Dl^ + = min[iy^^^l[\kGN']. \i k e and k g ey,then it 
follows from same argument as in case (a) that v 7 in r/ is greater than cost of v 
^ 7 in G implying is not a (n-1) hop minimum tree, which again is a contradiction 

UCOO-355-2 18 EL645676945US 



of the assumption. Because K has the highest priority among all members of Qj and 
Qv ^ Qj and keQ^, K also has the highest priority among all members of 2v 
Therefore c Qj . Also, from the same argument it can be inferred that KeQ^. This 
proves that v -» 7 will be included in the construction of G'^ . Because D'/ + d^- = 

5 Z)^'-^ in G , where d^j is the final cost of link v -» 7 , and length of j v in &„ less than 
or equal to Z)^'^ from the induction hypothesis, the length of / 1-^ v in Gl is less than or 
equal to Z)^'^ . This proves part 1 of the theorem. 

Theorem 3: A finite time after the last link cost change in the network, the main 

P 

% topology T' at each node / gives the connect shortest paths to all known destinations. 
IP Proof: The proof is identical to the proof of Theorem 2 and is performed by 

U induction on , the global time when for each node i, P is at least an n -hop minimum 
H tree. Because the longest loop-free path in the network has at most N - 1 links where 
H N is number of nodes in the network, t^_^ is the time when every node has the shortest 
P path to every other node, wherein t^^^ should be shown to be finite. The base case of 
15 t^_^ is , the time when every node has a one-hop minimum distance and because the 
adjacent link changes are notified within finite time, < oo . Let < co for some n < 

N . Given that the propagation delays are finite, each node will have each of its 
neighbors n -hop minimum tree in finite time after . From Theorem 2 we can see that 

the node will have at least the (n +1)-hop minimum tree in finite time after . 

20 Therefore, t^^^ < oo. From induction it will be appreciated that t^_^ < co. 
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A node generates update messages only to report changes in distances and 
predecessor, so after convergence no messages will be generated. The following 
theorems show that MPATH provides instantaneous loop-freedom and correctly 
computes the shortest multipath, 
5 Theorem 4: For the algorithm MPATH executed at node / , let be the time when 

RD'j is updated and reported for the n^^ time. Then, the following conditions always 
hold: 

FD){t,) < mm{iJZ);(^,_0.^;(^«)) (6) 
a ^(0 ^ FD){t,) t^[t,j^,,] (7) 

1 J Proof: From the working of MPATH in FIG. 6, it is observed that RD) is 

U1 updated at line 3c when (a) the node goes from PASSIVE-to-ACTIVE because of one or 

HI 

IB more distance increases; (b) the node receives the last reply and goes from ACTIVE-to- 

I; PASSIVE state; (c) the node is in PASSIVE state and remains in PASSIVE state 

q because the distance did not increase for any destination; and (d) the node receives the 

p last reply but immediately goes into ACTIVE state. The reported distance RD) remains 

unchanged during the ACTIVE phase. Because FD'j is updated at line 3a each time 
RD'j is updated at line 3c, Eq. (6) follows. When the node is in ACTIVE phase, FD) 
may also be modified by the statement on line 3f, which implies Eq. (7), 
Theorem 5: The safety property; at any time / , the successor sets S'j [t) which are 
20 computed by MPATH are loop-free. 
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Proof: The proof is based on showing that the FDj and Sj computed by 

MPATH satisfy the LFI conditions. Let , be the time when RDj is updated and 
reported for the n^^ time. The proof is by induction on the interval Let the LFI 

condition be true up to time , we show that: 

FD){t) < te[t„,t„,,] (8) 

From Theorem 4 we have: 

FDiM ^ min{RD'j{t„_,),RD'.(t„)} (9) 

FD^{t„,^ < min{RD'j{t„),RD'j{t„,,)} (10) 

FD'j{t) < FDi{t,) t^{t„,t„,,] (11) 

1!^ Combining the above equations we arrive at: 

m 

g FD^it) < (12) 

Let f be the time when a message sent by i at is received and processed by 
neighbor k . Because of the non-zero propagation delay across any link, f is such that 
tn < < tn+\ and because RD'j is modified at and remains unchanged in {t^j^^i) it 
1 5 follows that: 



RD](t„_,) < Dl{t) te[t„,t'] (13) 

^j{tn) ^ D],(t) t^[t,,t^,,\ (14) 

FromEq. (13) and (14): 

min{RD)(t,_,),RD)(t:)\ < D',{t) te[t„,t„,,] (15) 

20 From (12) and (15) the inductive step (8) follows. Because FD'j [t^) < Di{tQ) at 
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initialization, from induction it is known that FD){t) < Dj.(t^) for all t. Given that the 
successor sets are computed based on FD'j {t) , it follows that the LFI conditions are 
always satisfied. According to the Theorem 1 this implies that the successor graph SGj 
is always loop-free. 

Theorem 6: Liveness property; a finite time after the last change in the network, the 
Dj gives the correct shortest distances and Sj = \ Dj < Dj, k^N'^ 

Proof: The proof is similar to the proof of Theorem 4. The convergence of 

MPATH follows directly from the convergence of PATH because the update messages 
in MPATH are only delayed a finite time as allowed at line 4 in algorithm PATH. 
Therefore, the distances D] in MPATH also converge to shortest distances. Because 

changes to Dj are always reported to the neighbors and are incorporated by the 

neighbors in their tables in finite time Djj^ = Dj , for A: e N' after convergence. From 

line 3a in MPATH, it is observed that when node i becomes passive FDj = Dj holds 

true. Because all nodes are passive at convergence it follows that Sj = 

{k\Di,<FDi, kG N'] = {k I D) < Z)), k e N'} . 

3. Complexity Analysis 

The main difference between PATH and MPATH is that the update messages 
sent in MPATH are delayed a finite amount of time in order to enforce the invariants. As 
a result, the complexity of PATH and MPATH are essentially the same and are 
therefore collectively analyzed. 
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The storage complexity is the amount of table space needed at a node. Each 
one of the A^' neighbor tables and the main distance tables has size of the order 

and the main link table T' can grow, during execution of MTU, to size at most | N' \ 
times 0(1^^1) . The storage complexity is therefore of the order oi^N^ j . 

The time complexity is the time it takes for the network to converge after the last 
link cost change in the network. To determine time complexity it is assumed that the 
computation time is negligible in comparison with the communication time. If is the 
time when every node has the n-hop minimum tree, because every node processes 
and reports changes in finite time \ t^^^ - tn\\s bounded. Let \t^^^ - \ < 6^ for some 
finite constant 0. From theorem 3, the convergence time can be at most \ N\6 and, 
hence, the time complexity is 0(|A^|). 

The computation complexity is the time taken to build the node's shortest path 
tree in T from the neighbor tables . Updating of r with information is 

0{^N' \n I) operation and running Dijkstra on r takes oi^N' \n \log{\^N |)). Therefore 

15 the computational complexity is o| A'"' jA'' + N' N log(^N . 

The communication complexity is the number of update messages required for 
propagating a set of link-cost changes. The analysis for multiple link-cost-changes is 
complex because of the sensitivity to the timing of the changes. So, therefore the 
analysis is provided only for the case of a single link-cost change. A node removes a 
20 link from its shortest path tree if only a shorter path using two or more links is 

discovered and the path is stored. Therefore, a removed link will not be added again to 
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the shortest path which means that a link can be included and deleted from the shortest 
path by a node at most one time. It will be appreciated that since nodes report each 
change only once to each neighbor, that an update message can travel only once on a 
given link and therefore the number of messages sent by a node can be at most 0{\e\) . 

For certain topologies and sensitively timed sequences of link cost changes the 
amount of communication required by PATH can be exponential. One industry example 
(Humblet) exhibits such behavior, and though PATH is different from the shortest-path 
algorithm utilized therein, it should be noted that PATH is not immune from such 
exponential behavior. However, it appears that such scenarios would require 
sensitively timed link-cost changes which are very unlikely to occur in practice. If 
necessary, a small hold-down time before sending update messages may be used to 
prevent such behavior. 

Accordingly, it will be seen that this invention provides a routing algorithm based 
on distance information that provides multiple paths that need not have equal costs and 
that are loop-free at every instant, without requiring inter-nodal synchronization 
spanning more than one hop. The loop-free invariant conditions presented here are 
quite general and can be used with existing internet protocols. The multiple successors 
that MPATH makes available at each node can be used for traffic load-balancing, which 
is necessary for minimizing delays in a network as has been shown using other 
algorithms, such as MPDA. MPATH can therefore be used as an alternative to MPDA 
to get similar performance. 

Although the description above contains many specificities, these should not be 
construed as limiting the scope of the invention but as merely providing illustrations of 
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some of the presently preferred embodiments of this invention. Therefore, it will be 
appreciated that the scope of the present invention fully encompasses other 
embodiments which may become obvious to those skilled in the art, and that the scope 
of the present invention is accordingly to be limited by nothing other than the appended 
claims, in which reference to an element in the singular is not intended to mean "one 
and only one" unless explicitly so stated, but rather "one or more." All structural, 
chemical, and functional equivalents to the elements of the above-described preferred 
embodiment that are known to those of ordinary skill in the art are expressly 
incorporated herein by reference and are intended to be encompassed by the present 
claims. Moreover, it is not necessary for a device or method to address each and every 
problem sought to be solved by the present invention, for it to be encompassed by the 
present claims. Furthermore, no element, component, or method step in the present 
disclosure is intended to be dedicated to the public regardless of whether the element, 
component, or method step is explicitly recited in the claims. No claim element herein is 
to be construed under the provisions of 35 U.S.C. 112, sixth paragraph, unless the 
element is expressly recited using the phrase "means for." 



UCOO-355-2 



25 



EL645676945US 



